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Abstract — In information theory the reliability function and its 
bounds, describing the exponential behavior of the error probability, 
are important quantitative characteristics of the channel performance. 
Prom a more general point of view, these bounds provide certain mea- 
sures of distinguishability of a given set of classical states. In the paper 
[§] quantum analogs of the random coding and the expurgation lower 
bounds for the case of pure signal states were introduced. Here we 
discuss the case of general quantum states, in particular, we prove the 
expurgation bound conjectured in Q and find the quantum cutoff rate 
for arbitrary mixed signal states. 

Index Terms — Quantum channel, reliability function, random cod- 
ing, expurgation. 
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I. Introduction 



We consider classical-quantum channel with a finite input alphabet 
{1, a} and with arbitrary signal states given by density operators Si] i = 
1, a in a Hilbert space 7i. For simplicity of presentation we take TC finite- 
dimensional, although with obvious modifications the results hold for a sep- 
arable 7i. The classical channel corresponds to the case of commuting op- 
erators Si, represented by diagonal matrices diag (P(l\i), . . . , P(b\i)) , where 
P(j\i) is the channel transition probability 

Product channel of degree n acts in the tensor product 7i® n = Ti®. . 
of n copies of the space Ti. Sending a codeword w = ■ ■ ■ ,i n ),ik 
{1, . . . a}, produces the signal state S w — S ix <g) . . . ® S in in the space TC m '. A 
code (W, X) of size M in H® n is a collection of M pairs (w 1 , Xi), . . . , (w M , X M ), 
where W = {w 1 , . . . , w M } is a codebook, X = {X%,...,Xm} is a quan- 
tum decision rule, i.e. a collection of positive operators in 7i® n , satisfying 
Ylj=i Xj < [H • The conditional probability to make a decision in favor 
of message w fc provided that codeword was transmitted is TrS wJ Xk, in 
particular, the probability to make wrong decision is equal to 

Pj{W,X) = 1 -Tr S wi Xj. 

One usually considers the error probabilities 

P ma ,(W,X)= max P,(W,X) 

l<j<M 

and 

1 - 

P(W,X) = -53P i (W,X). 

J=l 

We shall denote by P e (M,n) any of the minimal error probabilities 
minw,x Pmax(W, X), minw^ P(W, X). It is known that they are essentially 
equivalent from the point of view of information theory 0, see also Sect. 3. 

The classical capacity of the classical-quantum channel is defined as the 
number C such that P e (2 nR , n) tends to zero as n — > oo for any < R < C 
and does not tend to zero if R > C. Moreover, if R < C then P e (2 nR ,n) 
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tends to zero exponentially with n and we are interested in the logarithmic 
rate of convergence given by the reliability function 

E(R) = - lim inf-logP e (2 n/? ,n), < R < C. (1) 

In the classical information theory |5| there are lower and upper bounds for 
E(R), giving important quantitative characteristics of the channel perfor- 
mance. From a more general point of view, these bounds provide certain 
measures of distinguishability of a given set of classical states. In the paper 
|H quantum analogs of the random coding and the expurgation lower bounds 
were given for the case of pure signal states Si, represented by rank one den- 
sity operators. Here we discuss the general case, in particular, we prove the 
expurgation bound conjectured in M. 

II. The capacity and the random coding lower bound 



The classical capacity of the channel is given by the formula 



C = max 



,i=i 



i=i 



(2) 



where H(S) = — TrSdogS" is the von Neumann entropy of the state S and 
7r = {jii} are probability distributions on the input alphabet {1, ...,a}. This 
relation was established in [[Uj, f[3"|| , using the concept of typical subspace 
[§]. The proofs in the present paper are direct, making no use of this concept 
and of the relation (0). 

Proposition 1: For any 7r and < s < 1 



H ( TTiSi ) - ^ ■KiH(Si) > -fi(lT, S), 

\ti J U 



(3) 



where 



l+s 



//(7r )S ) = -logTr 5>^ 1+ * 



j=i 
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Proof. Denote by 
H(S,T) = 



TrS(\ogS — logT), if suppS 1 C suppT, 
+00 otherwise 



H r (S, T) = — log TrS T r ; < r < 1, 

the relative entropy and the Chernoff-Renyi entropy of the density opera- 
tors S, T, correspondingly (see [[H]])- Since H{S 1 T) = H r (S,T), and 
H r (S,T) is concave, we have rH(S,T) > H r (S,T). Now 



h ^ - E = E E ^ 



,i=i 



i=l \Z=1 / ' i=l \/=l / 

by convexity of — log . By quantum Holder inequality [|TI| , the argument of 
log is less than or equal to 



rq \ 



T ME^H Tr E 7 ^ 



ifp 1 + g 1 = l,p>l. Putting p 



l-r' 



J=i 
1 



l-r ' 



and using mono- 



tonicity of log, we obtain (|3]). □ 

Assume now that the words in the codebook W are chosen at random, 
independently, and with the probability distribution 



P{w = (h, ...,i n )} = ir tl 



■ 7T,:, 



(4) 



for each word. We shall denote expectations with respect to this probability 
distribution by the symbol E. In || we conjectured the following random 
coding bound for the error probability 



EminP(W,X) < c inf (M — 1)' 

X 0<s<l 



1+s 



Tr 



,4 = 1 



(5) 



4 



The bound fl5|) holds for pure states Si in which case Sl +S = Si and c = 2 [||]. 
For commuting Si it reduces to the classical bound of Theorem 5.6.2 [g| with 
c — I. By putting M = 2 nR , it implies the lower bound for the reliability 
function 

E(R) > max sup [/i(7r, s) — sR] = E r (R). (6) 

71 0<s<l 

This can be calculated explicitly for quantum binary and Gaussian pure state 
channels |J. A remarkable feature of the classical case is that there exists 
the upper bound (the sphere-packing bound) which coincides with E r (R) 
for high rates, and thus gives exact expression for E(R). In the quantum 
case no useful upper bound for E(R) is known yet (see, however, [II] for an 
incomplete analog of the sphere-packing bound). 

We shall prove a general inequality for the error probabilities Pj(W, X), 
which implies (j5|) for s = 1 with c = 1, and will be used in the next Section 
to obtain the expurgation bound. Moreover, the first part of the argument 
will be used for alternative operator proof of ( in case of commuting S i: 
indicating clearly at which point commutativity comes into play. The proof 
of (|5|) in full generality remains open question. 

Lemma: For any collection W of codewords there is a decision rule X 
such that 

P j (W,X)<Try/3^^2y/S~: i j = l,...M. (7) 

Proof. By making a small perturbation of the density operators S w3 , we 
can assume that they are nondegenerate. We choose the following suboptimal 
decision rule 

M M 

x 3 = E s wl t 1/2 s w1 r (J2 ^ T 1/2 , (8) 
i=i i=i 

where r is a real parameter, < r < 1. This gives 

Pj{W, X) = 1 - Tr S w jAj*Aj, (9) 
where Aj = S wj r ^ 2 {^2dL\ S w i r )~^ 2 - Using the Cauchy-Schwarz inequality 

|Tr5' u ,jj4j| < T^iSujjAj A^ 

we obtain 

M 

P,-(W,X) < 2(1 - Tr S w j A j) = 2[1 - TrS^^S^ T^l (10) 
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Let S w j = J2 a l e i )( e i I ^ e ^ ne spectral decomposition of the operator 
S wJ , then (0) takes the form 



EM Q 
1=1 

(A 



-1/2 



|c?>. 



Applying the inequality 

2(1 - X - l/2 ) < (x- 1), x>0, 

we obtain 



-1/2' 



< 



By substituting this into (|TT|), we see that for < r < 1 



(12) 



(13) 



(14) 



(15) 



in particular, for r = 1/2 we obtain (^). By continuity argument we can drop 
the assumption of nondegeneracy of the operators S wJ .D 
Corollary: For any collection of states Sf, % — 1, . . . a 

2 



C > — log min Tr 



E^v^ 



4=1 



(16) 



Proof. Let us apply random coding. Then from ([?[) using the fact that 
the words are i.i.d., we find 



EminP(W,X) < (M - l)Tr(Ev^J) 2 . 



(17) 



The expectation is 



Tr(EV^~) 2 = Tr 



n 2 



E 



Ti, 



Tr ^E^V^J 
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which gives (|5]) with s = 1, c = 1. Choosing M = 2 , we get (|IED. □ 

Remark: The above proof did not involve the quantum coding theorem 
(□). On the other hand, by letting s = 1 in (W) we obtain inequality 



H ( -^vr^(^)>-logTr 



,i=i 



i=l 




(18) 



which, combined with (|]), also gives (16|). 

The quantity in the right-hand side of (|1^) is a quantum analog of the 
cutoff rate widely used in applications of information theory (see 0). Since it 
is easier to calculate than the capacity (the minimum over 7r can be evaluated 
explicitly), it can be used as a practical lower bound. In particular, consider a 
quantum-quantum channel $, which is a completely positive trace preserving 
map of states, and substitute Si = $® n (Tj), where Tj are the input states in 
T~C® n to be optimized after. Then flTBD implies a lower bound for the classical 
capacity of the channel, which might be relevant to the problem of additivity 
of the capacity with respect to entangled inputs @ , H , although the problem 
of additivity of the cutoff rate is itself by no means simple. 

Finally let us show how the bound ([5]) can be obtained for commuting Si 
along these lines. Taking expectation of (p"T|), we get 



EP,(W,X)<E^AJ(e»|2E 



EM q r 
1=1 



-1/2" 



(A 



) 




(19) 



where inside is the conditional expectation with respect to the fixed word 



w ] . Taking this conditional expectation in fll4|), we get 



2 



< (A?)" r (M- 1)E5 ( 




(20) 



(21) 



On the other hand, the left hand side is less or equal than 21. If all operators 
commute, all the matrices are diagonal in the basis | e f | which is the 



same 
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for all j, and we can use inequalities min(2, x) < 2x s and (x + y) s < x s + y s , 
valid for x,y > and < s < 1, to obtain that the left hand side does not 
exceed 



(Ajr s (M-l) s (E^T + E 



1 

A" 



- 1 



Substituting this into (0), we obtain 

EP,-(W,X) < E^(A«) 1 -(e«|2(M-ir(E^/T le-) 



= 2(Af-irTr(E^ 1 "-)(E^ 
Choosing r = this gives 



EPj(W,X) < 2(M - l) s Tr ES 



1 + s 



1+s 



(22) 
(23) 

(24) 



whence (^) follows. 

Moreover, we can omit the factor 2, if we use commutativity from the 
start, avoid the Cauchy-Schwarz inequality and use 1 — x~ x < x — 1, x > 0, 
instead of (O). 



III. The expurgation lower bound 



As it is well known in the classical information theory, for low rates R 
codes with high probability of error become to dominate in the random cod- 
ing ensemble. In order to reduce the influence of choosing such bad codes 
ingenious expurgation technique has been developed, see f|, Ch. 5.7. 

Theorem: For arbitrary density operators Si the expurgation bound holds: 



minP m(KC (>V,X) 



w,x 



< inf 

S>1 



4(M- 1) 



,k=l 



(25) 



Proof. Using 
for s > 1 



and the inequality (^ a>i) r < a[ , < r < 1, we obtain 
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(^•(W.X)) 1 /' < (J2^VS^V^) 1/S < J2( TI ^^) 1/S - ( 26 ) 

We again apply the Shannon's random coding scheme, assuming that the 
codewords are chosen at random, independently and with the probability 
distribution (^) for each word. We start with an ensemble of codes with 
M' = 2M — 1 codewords. Then according to the Lemma from Ch. 5.7 || 
(which is a simple corollary of the central limit theorem) there exists a code 
in the ensemble of codes with M' = 2M — 1 codewords, for which at least M 
codewords satisfy 

PjiW, X) < [2EP,(W, X) 1 / 5 ] s , (27) 

for arbitrary s > 1 (without loss of generality we can assume that ( |2?| ) holds 
for j = 1, . . . , M). Then taking into account that M' — 1 = 2(M — 1), we 
have from (^) 

P,(W,X) < ^(M-^E^rv^v^) 1 / 3 ] 5 . (28) 

Using the fact that the words are i.i.d., we find 

E^rv^v^) 1 ^ 

= J2 ^ • • ■ ***** ■ ■ ■ ^(Trv^v 7 ^) 178 • • • ^^S~^/S~f /S 



«i,...i„;]l,...j„ 



^^•(TrvW^) 1 



l/s 



1,3 



(29) 



whence the theorem follows. □ 

Again, it is convenient to introduce the function 



/i(7T, S) = S log TliK k {^\J^i\fi3~k) * 



i,k=l 

then taking M = 2 nR , we obtain the expurgation lower bound for the relia- 
bility function 

E(R) > maxsup(/i(7r, s) — sR) = E ex (R), 

n S>1 
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The function /i(7r, s) is concave (see Appendix), increasing from the value 



h{tt, 1) = h(t, 1) = - logTr ^TTiv^ 



j=i 



for s = 1 to 



/i(7T, OO) = - ^ TTjTTfc logTrv^V^ 



i,fc=l 



(which may be infinite). 
By introducing 



E ex (ir, R) = sup[/i(7r, s) - sR], 



S>1 



(30) 



we can investigate the behavior of E ex (7r, R) like in the classical case. Namely, 
for < R < p/(7r, 1), where pf(n, 1) < £l(ti, 1) (see Appendix), the function 
E ex (7T,R) is concave, decreasing from 



E ex (n, +0) = yu(vr, oo) 



(31) 



to Egxljl'lTT, 1)) = /i(7r, 1) — jl' '(n , 1). In the interval ^'(tt, 1) < R < /2(7r, 1) it 
is linear function 

E ex (ir,R) = /i(7r, 1) - R, 

and E ex (7i, R) = for /2(tt, 1) < i? < C. 

Finally, let us evaluate the limiting value E(+0) of the reliability function 
at zero rate. We remind notation \A\ = V A* A where A is an operator in 7i. 

Proposition 2: If S^S^O for any 1 < i, k < a then 



-min ^ TTiTTfclogTrv^v^ < E(+0) 



i,k=l 



< —2 min > 7Tj7Tfc log Tr 

7T ' J 



i,k=l 



Si \ Si 



(32) 



If SiSk = for some i, k, then E(+0) = oo. 

Proof. The proof is a generalization of that given in 0] for the case of pure 
states. Note that in this case the left and right hand sides of (|32D coincide, 
giving the exact values of E(+0). 
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From (|31"D we see that E(+0) is greater than or equal to the left hand 
side of (j32|). On the other hand, 

P m ax(VV,X) > maxmin P({S W , S w >} , X), 

where w, w' are arbitrary two codewords from W. The minimal error prob- 
ability of discrimination between the two equiprobable states S w , S w i is 

mmP({S w , S w ,} , X) = ^[1 - ^Tr \S W - S w >\}. 

(cf. 0). In |7| the following estimates were established for the trace norm 
of the difference S\ — S 2 of any two density operators Si, S 2 : 



2(1 - TrVSiVSO < Tr |Si - Sal < 2y 1- (Tr^Si VS 2 ) 2 - 
The proof of the second inequality can be easily modified to obtain 



Tr\Si - S 2 \ < 2 A /1 - (Tr 
(see also [[§]). Therefore we get 



Si v 5*2 



nmi P({S W , S w >} ,X) 



1 












> - 
- 2 


(Tr 




? 


> i(Tr 





and 



It follows that 



Pmax (W, X) > max - (Tr 



Sy; \/ S w 



E(+0) < — lim — max log Tr 



Repeating argument from the proof of Proposition 3 from H , we obtain the 
second inequality in (^). □ 
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Appendix 
Properties of the functions jl(7i, s),p,(n, s). 

1. Let us calculate derivatives of the function /2(7T, s) with respect to s and 
show that the first derivative is nonnegative, while the second is nonpositive. 
Denoting F ik (s) = (Tr y/S^y/S^)^ , F(s) = Yn,k=i ^i^kF ik {s), we have (with 
log denoting in what follows natural logarithms) 

8 ) = - log F(s) - sF(s)- 1 F'(s) 

a 

= F(s)- 1 Kiit k F ik (s) (log F ik (s) - logF(s)). 

i,k=l 

Using the inequality 

x(logx — logy) > x — y, x, y > (33) 

(cf. Proposition 3.16 of [Tl| ), we see that indeed P,'(tt,s) > 0. Taking into 
account that F'(l) < 0, we also obtain /2'(7r, 1) < yu(7r, 1). 
The second derivative 

^(n,s) = (sF(s) 2 )- 1 



- 2j -Kiir k F ik (s){\ogF ik (s)) 2 -Ki>K k F ik (s) 

i,k=l i,k=l 

is nonpositive by Cauchy-Schwarz inequality. 



\i,k=l 
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2. Let us show that /i'(vr, s) > 0. Introducing operator valued function 

i=l 

and letting G(s) = TrA(s) 1+s , we have /^(vr, s) = — logG(s), so that 

//(tt,s) = -G^)- 1 ^). 
To calculate we use a generalization of formula (3.17) from |ll]], namely 



We then obtain 
d 



Trf(s, A(s)) = Tr f s (s, A(s)) + Tr f A (s, A(s))A'(s). 



G(s) = -TrA(s) s ^7r^,, 



l + s 



1=1 



log Sl +S - log A(s) 



From 



ds 
we have 

y s x{logx - logy) > y s (x - y), x,y > 0, 



(34) 



(35) 



(36) 



therefore by Proposition 3.16 from [11] 



d 



7T; 



i=l 



0. 



In the classical case the function /x(7r, s) is concave in s |5[], Appendix 5B. 
We conjecture this property holds also in the quantum case, and we postpone 
this problem to a separate investigation. 

3. To compute E r (R) according to the definition (]|) it is expedient to per- 
form maximization with respect to it first. Maximizing fi(iT, s) is equivalent 
to minimizing 



l+s 



i=l 



By Proposition 3.1 of [yjj this function is convex in n, which makes the 
general criterium of Theorem 4.4.1 from p[ applicable. 
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From this theorem it follows that probability distribution ir minimizes 
G{ji, s) if and only if there exists a constant c such that 

— a > c, ] = !,..., a, 

with equality for those j, for which ttj > 0. After some computation, this 
amounts to 

Tr5 ^(|>^) ^(j>^) , (37) 

with the corresponding equalities. 

By using this necessary and sufficient condition one shows, as in Example 
4 of Sec. 5. 6 ||, that for two parallel channels 1 and 2 

max //i2(tti2, s) = max /ii(7Ti, s) + max ^2(^2, s), 

implying a corresponding additivity property for E r (R). This gives an answer 
to a question posed by R. Ahlswede. It is worthwhile to remind that the 
additivity property does not hold in general for s) even in the classical 
case H, Problem 5.26. 
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